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Abstract 



Symmetries occur naturally in CSP or SAT problems and are not very 

difficult to discover, but using them to prune the search space tends to be 

very challenging. Indeed, this usually requires finding specific elements 

in a group of symmetries that can be huge, and the problem of their 

very existence is NP-hard. We formulate such an existence problem as a 

constraint problem on one variable (the symmetry to be used) ranging over 

a group, and try to find restrictions that may be solved in polynomial time. 

I ' ' By considering a simple form of constraints (restricted by a cardinality 

. _: ' k) and the class of groups that have the structure of Fp-vector spaces, 

_^ , we propose a partial algorithm based on linear algebra. This polynomial 

• ' algorithm always applies when k = p = 2, but may fail otherwise as 

^_ I we prove the problem to be NP-hard for all other values of k and p. 

Experiments show that this approach though restricted should allow for 
an efficient use of at least some groups of symmetries. We conclude with 
a few directions to be explored to efficiently solve this problem on the 
general case. 



keywords: symmetries, linear algebra, complexity 

1 Introduction 

Symmetries are permutations of input symbols that, when applied to an instance 
of a computational problem, leave its solution invariant. Since naive algorithms 
reproduce these symmetries in their search space, it is tempting to use them as 
a pruning device. A typical example is the pigeon-hole problem in propositional 
logic: its symmetries show that any pigeon can be swapped with any other (and 
similarly for the holes), and therefore play equivalent roles. 

Since invariance is stable by composition, the set of symmetries of an instance 
forms a permutation group. This means that the information they provide has 
a mathematical structure that should yield nice computing properties (even if 



they occur only at the meta- level). In fact, some techniques from computational 
group theory have been employed to discover symmetries and to use them. 

However, it seems that algorithms can use symmetries in a straightforward 
way only if their search space preserves the structure of the group of symmetries 
in some sense (subtrees should somehow correspond to subgroups). This is 
usually not the case for the most efficient algorithms developed in the field 
of AI. Hence many different methods have been developed in order to prune 
the search space with symmetries, either by designing special algorithms or by 
modifying instances through symmetry-breaking. Much work has been devoted 
to this subject, see e.g. [T^] and the references therein. 

One feature common to these methods is that they ideally assume the ability 
to produce symmetries that have particular properties, suitable for the pruning 
scheme. But this problem also happens to be NP-hard, which explains why 
symmetries that do not result in the best pruning may be used. The time 
spent on searching and using symmetries does not always pay off. This suggests 
that the group structure may not be sufficient to induce enough computational 
properties to ensure efficient pruning. 

Our aim in this paper is to investigate ways of finding suitable symmetries 
in polynomial time. To this purpose we first formulate this search problem by 
a language of constraints as simple as possible. This is the topic of Section [31 
We then consider restrictions of this search problem that confer deeper math- 
ematical structure to the groups. The idea is to transform the constraints into 
linear equations, which would then be easy to solve by means of basic computer 
algebra. To the best of our knowledge, although restrictions to vector spaces 
have already been considered in the literature, such a transformation represents 
a novel approach. This means that we need to assume that the groups are 
also vector spaces, and to develop ways of efficiently working with symmetries 
as vectors (i.e., essentially of computing their coordinates in a suitable basis). 
This is developed in Sections H] and [H leading to a polynomial algorithm that 
solves so-called linear constraints. We will then see in Section [5] that, even with 
the simple constraints and vector spaces, the search problem remains NP-hard 
in most cases. Experiments in Section [7] illustrate the efficiency of this polyno- 
mial algorithm on random samples of linear constraints, compared to a general 
purpose algorithm. We suggest in the conclusion a few directions for using this 
approach in a wider setting. 

2 Definitions 

We do not recall the most basic definitions and notations from group theory or 
linear algebra, such as cycles or bases, which can be found in standard textbooks, 
e.g. [3], except in order to settle notations. 

Given a finite set A, we denote by Sym(>l) the group of permutations of A. 
If 5i, . . . , gm are permutations of A, we denote by \gi, . . . , gm] the subgroup of 
Sym(A) generated by these permutations. For a G A and g,g' S Sym{A), the 
image of a by g is denoted by a^, and the composition of permutations g' o g 



by gg' , so that a^^ — {a^Y . From a computational point of view, it is obvious 
that the product gg' can be performed in time linear in \A\. The order of g is 
the smallest positive integer n such that g" is the identity. 

Let G be a permutation group on A, the orhit of a in G (or G-orbit of a) 
is a'^ = {a^ \ g G G}. The set of G-orbits forms a partition of A, denoted by 
OV{G). The group G is transitive if it has only one orbit (i.e., OV{G) = {A}). 
It is easy to see that OVilgl) can be obtained from the cycles of g, e.g. if 
A = {1, . . . , 6} then OVi\{l 2)(3 4 5)1) = {{1, 2}, {3,4, 5}, {6}}. 

The refinement order on partitions of A (P C P' iff VO £ P, 30' e P' s.t. 
O C O') is a complete lattice; the least upper bound P U P' is obtained by 
merging the non-disjoint elements of P and P'. The smallest partition is J-a = 
{{a} \ a £ A} and the greatest is Ta = {A}. Given two permutation groups 
G and G' on A, OV{\G U G']) = OP(G) U OP(G') (see [fl chap. 7]). Hence, 
starting with m generators gi,. . . ,gm the orbit partition OV{\gi, . . . ,5m]) = 
V!k=i ^T^{\9i\) can be computed in time polynomial in m and |^|. 

A group G is an elementary Abelian p-group if it is Abelian and its non-trivial 
elements have order p, a prime number. It is simple to test this property on 
the generators of a group: G is an elementary Abelian p-group iff its generators 
commute and have order p. If this is the case we adopt the additive notation, 
e.g. (a b) + {c d) = (a b){c d), 2{a b c) — {a b c)^ = {a c b) and is the 
identity. By considering multiplication by an integer as an external product 
on the set of integers modulo p, we confer to G the structure of an Fp-vector 
space. Conversely, every Fp-vector space is isomorphic to an elementary Abelian 
permutation p-group. 

Since the class of elementary Abelian p-groups is closed under homomorphic 
images and for all O € OV{G) the restriction to O operator is a morphism from 
G to Sym(O), if G is an Fp-vector space then so is its image G\o = {g\o 1 .9 ^ G} 
(although it may not be a subgroup of G). The groups G\o are the transitive 
constituents of G. 

In the sequel, unless stated otherwise, a,b,c denote members of A, u,v,w 
denote vectors (i.e., permutations on A), h and f denote bases of vector spaces, 
and the Xi^s coordinates in vector spaces (i.e., integers modulo p). 

3 Constraints on symmetries 

As mentioned above, techniques designed to compute generators for a group of 
symmetries are well-known. Basically, they consist in building a graph with the 
elements to be permuted as vertices, and with edges and labels that depend on 
the instance, so that its automorphism group is the expected group of symme- 
tries. Building this graph is usually straightforward and its automorphism group 
can then be computed with e.g. the well-known program nauty [8l . Though not 
polynomial, this algorithm has a low average complexity and is very efficient. 

We thus assume given a group of symmetries that is specified by a generating 
set of m permutations gi , . . . , g^ of a set A. The group G = \gi, . . . , g^l is the 
set in which a permutation g satisfying a given constraint is searched for. In 



this section we define a language for expressing constraints on g that is both 
useful and simple. We begin with an example. 

We consider the problem MMC from [5]. Given a model M of a formula 
built on prepositional variables which are linearly ordered (say, x < y < z), 
^M being the interpretation defined by ^M{a) = M{a^), and given a group G 
by its generators, the problem MMC consists in deciding whether there exists a 
g & G such that ^M < M (interpretations are ordered lexicographically). This 
problem is required for computing symmetry-breaking predicates, and is shown 
in [2] to be NP-complete. We can translate ^M <M as: M{x3)M{y3)M{z3) < 
M{x)M{y)M{z), and then as 

M{x3) < M{x)y 

mIxS) = M{x) A [ M{y9) < M{y)y 

M{yS) = M(y) A M{zS) < M{z)]. 

A translation into disjunctive normal form yields: 

M{x3) < M{x) 

V M{x3) = M{x) A M{y9) < M{y) 

V M{x3) = M{x) A M{yS) ^ M{y) A Af (z^) < M{z). 

If X denotes {a \ M{a) < M{x)} then the first disjunct translates into x^ G X. 
Similarly, let X' = {a\ M{a) = M{x)} and Y = {a\ M{a) < M{y)}, the second 
disjunct is x^ G X' Ay^ £ Y, etc. Thus the problem MMC can be expressed by 
means of boolean combinations of atomic constraints of the form x"' 6 X, where 
a is the (unique) variable ranging in G. Note that the negation of any atomic 
constraint can be expressed as an atomic constraint|^ (with the complement set), 
hence negation can be ruled out from the language. Since the set of solutions of 
a disjunction is the union of the solutions of each disjunct, we focus on solving 
conjunctions of atomic constraints. 

Definition 1 We address the computational problem GC, for Group Constraint, 
that takes as input a set A of cardinality n, permutations gi, ■ ■ ■ , gm of A, and 
a constraint Lp which is a conjunction of atomic formulas of the form x" G X , 
where a is a unique variable and x £ A, X C_ A. The decision problem GC 
consist in checking for the existence of a permutation g £ [gi , . . . , gm~\ that 
satisfies all the conjuncts in ip (a conjunct x"^ £ X is satisfied by g if x^ £ X). 
The associated search problem is the computation of such a g if it exists. 

A number of transformations on the constraint that preserve the set of so- 
lutions can be applied. These are: 

x"" e X A x'" e y — > x" gx^y, 

Lp — > (fi A x"^ <E x^. 

The correctness of the first transformation is obvious and that of the second 
trivial since x^ £ x'^ holds for all g £ G. By applying these transformations 



^This is no longer true if we break down x^ £ {xi, . . . , Xn} into x^ = xi V . . . V x^ = Xn 
hence atomic constraints of the form x^ = y would not necessarily be simpler to handle. 



systematically (together with associativity-commutativity of conjunction), as- 
suming that A — {«!, . . . , a„} it is always possible to transform any given con- 
straint into an equivalent constraint of the form a^ G Ai A . . . A a^ € An, where 
Ai Q af for all 1 < z < n. Computing this normal form is linear in the length 
of (/3 and 1^1 . 

This normal form may be represented as a function C from yl to 2 , where 
C{ai) — Ai. The problem GC then consists in finding a g e G such that Va G 
A,a^ g C{a), and we may consider C directly as an input of the problem, so 
that an instance of GC is a tuple {A,gi, ... ,gm,C) (or {A,G,C) in short). We 
say that C is a k-constraint if Va S A, \C{a)\ < k. 

As mentioned above, in the sequel we only consider, for every prime p, the 
restriction of GC to the instances where G = [gi , . . . , gm\ is an elementary 
Abelian p-group, i.e., an Fp-vector spaco 

4 From permutations to linear algebra 

Before giving the technical details of the transformation from a problem on 
permutations to a problem in linear algebra, we briefly summarize the way we 
proceed. 

Since our approach is based on the transitive constituents G\o of G (for 
O G OV{G)), we first define a vector space F that contains as subspaces both 
the group G and the G|o's. We then prove a fundamental property of the 
Glo's and use it to realize a polynomial test of linear dependence restricted to 
these subspaces. This is used to compute a basis of each G\q and to obtain the 
coordinates of any u G G\q in this basis, in polynomial time. This directly yields 
a basis of F and the coordinates of any u G F in this basis. It is then standard 
to transform any linear variety of F, such as G, as a set of linear equations using 
basic linear algebra. Section [5] will be devoted to applying these techniques to 
the constraint C. In the sequel we write P for the orbit partition OP{G). 

4.1 The super-space F 

The groups G and the G|q's are of course all included in Sym(v4), but Sym(v4) 
is not elementary Abelian hence not a vector space. Let F be the permutation 
group on A generated by the transitive constituents of G, i.e., F = [[Jp^-p G|q] . 

Lemma 1 F is an ¥p-vector space that contains G and F = hIh G\q. 

oev 

Proof. F is generated by the union of generating sets of the groups G|o, i.e., 
by the set {gi\o \1 < i < m,0 E P}. Since orbits are mutually disjoint and 



■^Note that solving such constraints is much simpler than computing any lex-leader formula 
as in [7], where restrictions to vector spaces are already considered. Our constraints are 
therefore not relevant to the problem of building lex-leader formulas, and are not meant to be 
used in the symmetry-breaking scheme of [2]. 



permutations on disjoint sets commute, gi\o9j\o' — 9j\o'9i\o ^i O =/= O' . Fur- 
thermore gi\ogj\o = {9i9j)\o = {9o9i)\o = 93\o9i\o since G\o is Abelian. Hence 
F is Abelian, and similarly its non-trivial elements have order p. 

We therefore use the additive notation in F, which yields F = X^oeP^lo- 
For any O ^V^let A' = A\0 and Fq = ^o'^o ^\o': since Fq is a subgroup 
of Sym(^'): then G\o n Fq C Sym(O) n Sym(yl') = {0}. This proves that F is 
the (internal) direct sum of the GIq's, which is written F = 0og-p G\o- 

It is clear that G is a subspace of F since any u G G can be written as 
u = X^oG'P ^lo S'Hd hence belongs to F. O 

We call F the super-space of G. Since the sum in Lemma [1] is direct, the 
decomposition of any vector w in i^ as a sum of elements of the G|o's is unique, 
the dimension rf of F is the sum of the dimensions do of G|q for O e 7-", and a 
basis for F can be obtained by concatenating bases for the subspaces G\q. 

4.2 Orbits as afRne spaces 

For all O gV , the transitive constituent G\o is of course transitive in O. This 
trivial fact yields a fundamental property: 

Lemma 2 (0,G|o) is an affine space. 

Proof. We define the external sum + : Ox G\q — >■ O, for all a £ O and u G G\o, 
hy a + u = a^ (the image of a by u) and prove that the two axioms of affine 
spaces hold. For all v G G\q it is clear that 

{a + u)+v = (a")" = a("+") =a+{u + v). 

Consider (j)a : G\q — > O such that (paiu) ^ a + u, we prove that 0a is bijective. 
It is obviously onto since G\q is transitive on O: V6 £ O, 3u G G\q such that 
b = a" = <f>a{u). Assume now that (t>a{u) = 4>a{v), i.e., a^ = a" , then for any 
6 G O, if w G G\q is such that a™ = 6, then 

hence u = v, and (f>a is injective. O 

This is nothing more than a geometric interpretation of a known result: that 
transitive Abelian groups are "regular" (see [TOl theorem 10.3.4]). This entails 
that \0\ = |G|o|, and that the external sum is regular, i.e., an equality a + u = b 
determines every term from the two others. In particular the unique vector 
u G G|q such that a + u = 6 is usually written u ^ b — a (or oo), and we also 
write b — a — u for b — a + (— u), which is equivalent to b + u = a. Note that 
there is one external sum (and difference) per orbit, but since they are disjoint 
it is unambiguous to denote them with the same symbol. 

In the sequel we use the additive notation on sets S, S' of vectors or elements 
of an orbit, i.e., if £ G {-I-, — } then SeS' — {ses' | s G 5, s' G S'}. If one set is a 
singleton, say S = {s}, wc write seS' for {s}eS'. This is of course compatible 
with the notations already used when S and S" are subspaces of F. 



4.3 Computing a basis for F 

Since G\q is a finite Fp-vector space, its cardinality must be p"^" . But this 
cardinality is also that of O which is known, hence the dimension dp can be 
computed: do = log^ \0\. Furthermore, since gilo, . . . , ^mlo is a generating set 
for Glo, it is possible to extract a basis fo oi G\q from this set by discarding 
m — do linearly dependent vectors. For this a test of linear dependence is 
required. 

Lemma 3 For any subspace H of G\o, any u £ G\o and any a £ O, 

ueHiffa^e a". 

Proof. By regularity of the external sum, uEHiSa + uEa + H. By definition 
a + u = a'^ and a + H = {a + v\v e H} = {a" \v e H} = a" . O 

Linear dependence is therefore reduced to computing the orbit of an arbitrary 
point a G O. A basis of G\o can be built step by step by computing the 
corresponding orbit partition Q of O, with the following function B: 

B([],h, Q) = h, 

B(M@/, h, Q) - B^, h, Q) if a" e a[Q], 

= B{1, [u\@h, OV{u) U Q) otherwise. 

Here I is a list of vectors, @ is the concatenation of lists and [ ] is the empty 
list. 

Lemma 4 B([gi|o, . . . ,5,„|o], [ ], ±o) is a basis of G\o- 

Proof. Let fo = B([(7i|q, . . . ,5„i|q], [ ], J-o)- Since _Lo is the orbit partition 
of the trivial group {0} on O, the invariant Q = OPdh]) is maintained 
throughout the computation. This means that the class a[Q] of a modulo Q is 
the orbit a^^'^ , and that a" G cl[Q] is equivalent to u G [h] by Lemma [31 Hence 
u is added to h if and only if it is linearly independent from the latter, which 
proves that h remains free. This also proves that [Z@h] is invariant, hence 
[fo] = [siIoj • ■ • j ffmlol = Glo ^^'i fo is free, it is thus a basis of G\o- O 

Example. Let O — {1, ... ,8} and 

g, = (12)(3 4)(5 6)(7 8), 

32 = (15)(2 6)(3 7)(4 8), 

33 - (13)(2 4)(5 7)(6 8), 
We choose a = 1, then 

fo = ^{[91,92,93], [ ], -Lo) 

= B([52,33], [51], OV{g,)) since l^^ = 2 ^ l[±o] = {1} 

= B([53], [52,31], OP{g2)UOVigi)) since P= = 5 ^ l[OP(.gi)] = {1,2} 

= B([ ], [33,52,51], To) since P^ = 3 ^ 1[0^(32) U OVigi)] = {1, 2, 5, 6} 

= [53,52,51]- 
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Of course the algorithm can be interrupted once h has do elements (or 
equivalently when Q — To). Building io requires at most m recursive calls and 
the computation of exactly do < fn orbit partitions of O, each being polynomial 
in \0\ < n, hence fo is computed in time polynomial in n and m. 

The bases fo can be concatenated (in an arbitrary order) to form a basis f 
of F. The length d of this basis may be greater than to, but since 



do = logJOl < 



\o\ 

101 



necessarily 

d ^ Y.'^o < J2 

oep oev ^ ^ 

hence computing f is again polynomial in n and m. 

4.4 Computing coordinates in the basis for F 

The coordinates of any vector w e _F in basis f can be obtained by computing, 
for all O & P, the coordinates of u|o G G\q in the basis fo, and by concatenating 
these coordinates in the same order as the one used to build f . We show how 
to compute the coordinates in fo of the permutations in G\o- 

Since fo is a basis of G\q, there is a 1-1 correspondence from the tuples 
{xi, . . . ,Xdo) G (]Fp)'^° to the elements of G\q, given by 'J2i=i ^ihi, which can 
be computed in polynomial time: each Xihi requires composing Xi — l < p times 
the permutation hi of O with itself, hence computing a permutation from its 
coordinates in fo can be computed in time 0{dop\0\), which is bounded by 
O(nlogn) (since p is a constant). 

Computing the coordinates in fo of a given u G G\q means computing the 
inverse of the previous correspondence. This can be performed by browsing 
through all possible values {xi, . . . ,Xdo) G {^pY" until ^i°iXihi equals u. 
Since \0\ — \G\q\ — p''°, this requires at most \0\ iterations, hence can be 
computed in time 0{dop\0\'^), which is bounded by O(n^logn). 

The same technique may be applied if u is given as 6 — a, where a,b € O. In 
this case it is necessary to check whether b — a + ^^^^ Xihi, i.e., whether b is the 
image of a by the permutation X]i=i ^if^i- The complexity is thus the same as 
above. Note that this also allows to compute b — a explicitly as a permutation. 

From a practical point of view we should avoid repeated computations of 
permutations from coordinates: this could be done by storing values in a suitable 
array. Using our geometric interpretation, we choose arbitrarily an origin a € O 
and define the coordinates of any point b G O as those of the vector b — a 
relative to fo. We can therefore fill an array that associates its coordinates to 
each entry b € O, hy browsing through all possible coordinates as explained 
above. Since this array has \0\ entries, filling it takes polynomial time. Then, 
given a permutation u, we need only compute the image 5 = u" and pick the 
coordinates of b in the array; these are the coordinates of u. 



In the sequel we write f = /i, . . . , /d, and for any u G F, ii u — J2i=i ^ifi 
where the Xi G Vp are the coordinates of u in f, we write Uf = *(xi • • • Xd) the 
column matrix of these coordinates. 

4.5 A characterization of linear varieties in F 

Since the super-space F is isomorphic to the vector space {¥pY, and this iso- 
morphism can be computed (through the coordinates in f ) in both directions, 
computations with matrices can be substituted for computations with permu- 
tations. This means that standard algorithms from linear algebra apply, in 
particular Gaussian elimination. Note that exact computations can be per- 
formed in Fp, including division, in time at most quadratic in the number of 
bits (see, e.g. [51 p. 117]), hence in constant time in the present context. 

In particular, it is now straightforward to test whether a family oi I < d 
vectors ui, ... ,ui ^ F is linearly dependent, by first computing the coordinates 
Ui = '^j^iXijfj and then by performing Gaussian elimination on the I x d- 
matrix (xij) (which requires a number of operations at most cubic in d). The 
family is linearly dependent iff Gaussian elimination yields a zero row in the 
resulting matrix (the number of non-zero lines after Gaussian elimination is the 
rank of the matrix, i.e., the dimension of the space \ui, . . . , ui~\). 

Assume we are given a linear variety v + H oi F hy the permutation v and 
a generating set for the subspace H. We can compute the coordinates in f of 
these permutations and, using the previous procedure, extract a basis hi, ... , hd' 
of H from the generators of H, where d' is the dimension of H. The vectors 
hi, ... , hd' together with the vectors in f form a generating family of F; the 
free family hi, ... , hd' can therefore be completed into a basis h = hi, . . . ,hd 
of F by adding d — d' vectors taken from f . If P denotes the matrix whose i"^ 
column is {hi)f then P is the change of basis matrix from h to f : Uf = Pu^ for 
all u G F. This matrix is invertible and its inverse can be computed using the 
Gauss- Jordan algorithm in time cubic in d. 

This means that the coordinates of any vector w in h (or in any basis) can 
be computed in polynomial time through u^ — P~^Uf. Membership of u to the 
subspace H may be checked simply by making sure the last d~d' coordinates of 
Uh are equal to zero. This can be expressed by building the diagonal dxd-matrix 
D with ones on the last d — d' positions of the diagonal and zeroes elsewhere: 

\Q I 

where / is the {d — d') x (d — d') identity matrix. Thus vector u ^ F belongs to 
H iff -Dwh = 0. Let J^h ^ DP~^, we have shown that: 

Lemma 5 From any linear variety v + H of F can be computed in polynomial 
time a d X d-matrix A4h such that \fu G F, 

u ev + H iff M-HUf = MhV{. 



Conversely, it is well known that the set of solutions u of any system of linear 
equations on d unknowns (the coordinates of u in f ) is either empty or a linear 
variety of F . 

5 Solving linear constraints 

The vector space G being a linear variety of F by Lemma [U its elements are 
characterized as the solutions u of a system of linear equations Adc^f = 0, as 
shown in Lemma [S] (by taking v = 0). We now investigate how to characterize 
the constraint C by another system of linear equations. 

5.1 Constraints as sets of vectors 

The first step towards this characterization is to explicitly represent the set of 
vectors u & E that satisfy a constraint C. Let Vb = || C{a) — a for all O gV. 

aeo 

Lemma 6 A vector u ^ F satisfies C iff u E y Vq- 

oev 

Proof. Assume a vector u £ F satisfies the constraint C, i.e., for all a e A, 
a" e C{a). Then for all O in the orbit partition V and for all a G O, since 
a" = a"lo G O and (O, G|q) is an affine space, we may write a + u\q G C{a), 
and since C{a) C O, this is equivalent to u\o G C{a) — a. But this is true for 
all a G O, hence u\q G Vq- Since u = X^oeP^IO' vector u must belong to 

Conversely, let u G J^oev^o and let a £ A. If O = a'-^ then u\q G Vq, 
hence in particular u\q G C{a) — a, i.e., a" — a + u\q G C{a). Hence u satisfies 
the constraint C. O 

The sets Vq can be computed for each O E V hy selecting an arbitrary 
a € O and computing the coordinates of c — a in fo for all c G C{a), and this 
operation is polynomial in n for each c as mentioned in section 14.41 Provided C 
is a fc-constraint this yields at most k elements in C{a) — a. Then, Vq is the set 
of those vectors u from C{a) — a that also belong to C{b) — b for all 6 G O \ {a}, 
i.e., such that 6" G C(6), which requires that u be also computed as an explicit 
permutation. Computing the coordinates of all the elements of Vq is therefore 
polynomial in n and k. 

5.2 Linear constraints 

The size of the set J2oev ^o is Ooe-p \^o\- If one of the sets Vq is empty, then 
obviously constraint C is unsatisfiable. But in general this set is exponential 
in the number of G-orbits (and therefore in n, the worst case being 2t with ^ 
orbits of size 2). This motivates the following definition. 
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Definition 2 C is a linear constraint if > Vq is either empty or a linear 

oev 
variety of F. 

In order to clreck whetiier a constraint C is linear or not, we provide a simple 
characterization of this property. For O G 7^, let wq be an arbitrary element of 
Vo and Eq = Vq — wq ■ 

Lemma 7 >, ^o is a linear variety of F iffiO G V, dimfii^o] — log„ |Vb|- 
oev 

Proof. Since F — 0oe-p ^lo ^"^^ ^o Q G\q, it is clear that '^oev'^o is a 
linear variety of F iff Vq is a linear variety of G|o for all O G "P. If this is 
so then Eq is a subspace of G\q that does not depend on wq- Hence this is 
equivalent to all the Eq^s being subspaces, hence to \Eo\ — Eq- Again this is 
equivalent to pdimfBol = | ^Eq] | = |£^o| = |Vb|- O 

The dimension of \Eo\ is the rank of the matrix formed by the coordinates 
of the vectors in Eq, which can be computed in polynomial time and compared 
to logp|Vo|- Computing this rank is of course useless if logp|Vb| is not an 
integer. Hence the linearity of C can be tested in time polynomial in n (since 
\'P\ < n). Note that the case where Vq is a singleton corresponds to Eq being 
reduced to the trivial subspace {0} of dimension 0. 

Let w = Yloev ™o and E = ^pev ^O for every O G "P, so that J2oev ^o 
is the linear variety w + E (assuming it is not empty). 

Theorem 8 IfC is linear then the problem GC is equivalent to a system of linear 
equations on coordinates of a solution in t, and this system, can be computed and 
solved in polynomial time. 

Proof. If J^oev ^o ~ 9 then the instance {A, G, C) of GC has no solution, which 
is equivalent to the linear equation = 1. Otherwise X^oe'P Vo — w -^ E and 
by Lemma [6] any m G F is a solution of the instance {A, G,C) iS u ^ G and 
u E w + H, which is equivalent by Lemma [5] to 

McUf = 
MeU{ = MsWf. 

This is a system of 2d linear equations on d unknowns, it can be solved by 
Gaussian elimination in time cubic in d. O 



Corollary 9 The problem GC restricted to p = k = 2 is polynomial. 

Proof. Every non-empty Vq has at most fc = 2 elements. If Vo = {u} = u + {0} 
then dim[i;o] = dim{0} = = logj \Vo\; if Vo = {u,v} = u + {0,v - u} 
then dimfiJo] = dim[w — u] = 1 = log2 |Vb|, hence according to Lemma [7] the 
constraint C is linear. O 

This result holds both for the decision and the search problem. 

11 



6 NP-Completeness results 

6.1 Constraints with more than 2 elements 

If one Vo has more than 2 elements then C may not be a hnear constraint, 
and therefore the previous polynomial algorithm may not apply. In fact, we 
now prove that allowing constraints of a cardinality greater than 2 makes the 
decision problem GC NP-hard, whatever the value of p. We proceed by reduction 
from the problem of 1-satisfiability of positive fc-clauses. 

Let S be a finite set of propositional variables (which will be denoted by 
Greek letters), a positive k-clause is a subset C C S of cardinality k. Let S be 
a finite set of such clauses, then S is 1-satisfiable if there is an interpretation 
/ C S such that every clause C & S contains exactly one element in /. 

Given E and S, we build an instance of the decision problem GC (restricted 
to Fp-vector spaces) whose satisfiability is equivalent to the 1-satisfiability of S. 
Furthermore, the construction is polynomial in the size of the input clause set, 
i.e., it is polynomial in |S| and \S\ (not necessarily in k, which is a constant). 
This transformation consists in interpreting propositional variables and clauses 
inFp. 

We consider the space of functions from S to ¥p, written F^, with the 
standard sum {u+v){a) — u(a)+v{a) and the external product {xu){a) = xu{a) 
for all a S S, u,v G F?" and x G Fp. This is an Fp-vector space of dimension 
|S|, with generating set {5/3 \ (3 e E}, where (5^(a) ^ 1 ii a ~ /3 and otherwise. 

We construct a permutation group as a homomorphic image of the elemen- 
tary Abelian p-group F^ . The elements to be permuted are those of the set 



^s = l+J F^ 



ces 

of functions from C to Fp, where C is a positive fc-clause belonging to S. The 
cardinality of As is X^cgs^'*^' ~ p'^l'^l- Each element of As is a function 
that associates integers modulo p to k propositional variables, hence it can be 
encoded in constant size. 

Example. We consider the set E — {a,/3,7} and assume that p ~ 2. There are 
exactly 8 functions from E to F2, which we denote according to the following 
scheme. 

S 50 51 52 53 .94 55 56 .97 
a 1 1 1 1 

/3 1 1 1 1 

7OIOIOIOI 

Let C — {a, j3, 7}, which is a positive 3-clause on E, and S — {C}. Since C = E, 
we have 

As = l+J Ff = F^^ = Ff = {50,..., 57}. 

C'GS 

O 
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Let / denote the function from F^ to 871x1(^5) defined for all u G F^ , C € S 
and w G Fp , by w^'^'^^ = w + u\c- It is straightforward to verify that /(u) is 
indeed a permutation of As, and that f{u)f{v) = f{u + v); hence / is a group 
morphism and the group Gs generated by the permutations {f{Sa) | a £ S} is 
an elementary Abelian p-group. This generating set can obviously be computed 
in time polynomial in |S| and \As\. 

Example. We compute figs). For all w G Ff, ly-^^s^) = w + g^. We have 
go + .93 — .93 5 5i + .93 = .92, etc. and we easily obtain, in cycle notation 

figs) = {go gs){gi g2){gi g7){g5 go)- 
o 

Finally, let Cg be the fc-constraint on Gs defined for all C G 5* and w G F^, 

by 

Csiw) = {w + {So.)\c\aeC}. 

Example. Since Sc{ct) = 34, (5c(/3) — 52 and Sc{j) = 5i, we have for all w G Fg*, 
Cs{w) = {w + g4, w + g2, w + gi}. This yields for instance Csigs) = {ff7, gi, 52}- 
O 

Lemma 10 S is 1-satisfiable iffCs is satisfiable in Gs. 

Proof. First assume that S is 1-satisfiable, i.e., there is an interpretation / C E 
such that every clause G G S has exactly one element in /. Let u G F^ be defined 
by u{a) = 1 if a G / and otherwise. For C G 5, if {a} = G Dl then it is clear 
that u\c = (^a)lc: so that for aU w G FJ^, w^'^"^ = w+u\c = w + iSa)\c £ Csiw). 
Thus f{u) G Gs satisfies Cg. 

Conversely, suppose there is an element f{u) of Gs that satisfies C5, let 
/ = {a G S I u{a) = 1} and let G be any clause in S. For all w G F?', since 
^(;/(f) g Csiw) there is an a G C such that w^'^"'' — w + ((5q)|c hence such that 
ulc — (^q)Ic- Necessarily «(«) = 1 and therefore a £ G H I. If/3GCn/we 
similarly obtain u\(j = (<5^)lc:: hence /3 = a. This proves that GClI is a singleton 
for all C G S', and that / 1-satisfies S. O 



Theorem 11 For anyprim,ep, the problem, of solving k-constraints inWp-vector 
spaces is NP-complete if k > 3. 

This follows from the NP-completeness of the problem of determining 1- 
satisfiability of a set of positive 3-clauses (see [4j problem L04, p. 259]). 

6.2 Constraints with at most 2 elements 

If a Vo has exactly two elements but p > 3, then Vq cannot be a linear variety 
of F and the algorithm of Section [52] necessarily fails. We prove that allowing 
p > 3 makes the restriction of the decision problem GC to Fp-vector spaces 
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NP-complete, even with constraints of cardinality at most 2. We proceed by 
reduction from 1-satisfiability of positive p-clauses. 

Given a set S of positive p-clauses on S, we again consider the Fp-vector 
space Fp and define the set 

A's = E X Fp ttl 5 X Fp, 

whose cardinahty is p|S| +p|5|. Let /' be the fmiction from F^ to Sym(A^) 
defined for all u e F^ , {a, x) £ E x Fp and (C, y) e S x Fp, by 

{a,x)^'^'''> = {a,x + u{a)), 

pec 

It is straightforward to verify that /'(u) is a permutation of A'^: if (a, x}^ *^"-' = 
{a',x'y''^''^ then a = a' and then x = x'; if (C,y)-'''(") = {C ,y')f'^'^^ then C = 
C" and then y = y' . It is obvious that f'{u)f'{v) = f'{u + v), hence /' is a group 
morphism and the group G'g generated by the permutations {f'{Sa) | a G E} is 
therefore an elementary Abelian p-group. This generating set can be computed 
in time polynomial in |S| and \A'g\. 

Example. We assume the same E, p, C and 5" as in the running example of 
Section EH Then 



A's = {(a,0), (a,l), (/3,0), (/3,1), (7,0), (7,1), (CO), (C,l)}. 

The permutations f'{u) for u G Fj can again be expressed in cycle notation, 
for instance: 

/'(52) = ((/?,0) (/3,1))((C,0) (C,l)), 
/'(53) = ((/?,0)(/3,l))((7,0)(7,l)). 

(C, 0) is a fix-point of /'(ga) since 53(a) + 53(/3) + 53(7) = + 1 + 1 = 0. O 

Let C's be the 2-constraint on G'g defined, for all {a,x) e E x Fp and 
{C,y)eSx¥p,hy 

C's{{a,x)) = {{a,x),{a,x + l)}, 
C's{{C,y)) = {{C,y + 1)}. 

Example. Obviously C'5( (a, 0)) = C's((a, 1)) — {(a, 0), (q, 1)}, and similarly 
for /3 and 7. For the other two elements of A'g we have: 

C's{{C,0)) - {(C,1)} and C's((C,l)) = {(C,0)}. 

O 

Lemma 12 5" is 1-satisfiable iff C's is satisflable in G'g. 
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Proof. Assume S is 1-satisfiable, let / be an interpretation of S and consider 
u G Fp defined by u{a) = 1 ii a G I and otherwise. Tlie constraint on any 
{a,x) e S X Fp is satisfied since {a,x)^ ("^ — {a,x + u{a)) e C's{{o:,x)). For 
all {C,y) G S* X Fp there is a unique f3 G C such that m(/3) = 1 and m|c is zero 
elsewhere, hence (C,y)/'(-) = (C, 2/ + E/3gc "(/?)) = (C,y + 1) £ C's((C,y)). 
This shows that f'{u) G G'g satisfies C s- 

Conversely, suppose that an element f'{u) of G'g satisfies C s and let 
/ = {a e E \u{a) = 1}. Then u{a) ^ 1 for all a G I]\/, and u(a) G {0, 1} since 
{a,x)f (") G C s{{oi,x)), thus w(a) — (modulo p). Let C be a clause in S, 
the constraint yields {C,0)^ '^"•' = (C', f), hence X]flec'"(''') ~ -'^- "^^^ terms of 
this sum are either or 1, hence at least one must be a 1. Furthermore, there 
are at most p terms, hence only one can be a 1, say u{a) — f, then by defi- 
nition of u, a is the only member of C that belongs to /. Hence / 1-satisfies 5". O 



Theorem 13 For any prime p > 3, the problem of solving 2-constraints in 
¥p-vector spaces is NP-complete. 

7 Experimental results 

The polynomial algorithm for solving linear constraints has been implemented 
in the GAP system, using its facilities on permutations, matrix algebra and 
finite fields. The implementation, nicknamed Solvect (see "downloads" page 
on capp.imag.fr), is straightforward except for the fact that coordinates in 
the transitive constituents are kept in memory and hence computed only once 
(this is performed while computing the orbit partition of G). Its performance 
has been measured against a general purpose group search algorithm provided 
in GAP, described in [6j and refined in ^llj . The call to this algorithm is 

ElementProperty([(7i, . . . , f^ml , 5 i— !■ Va G A, a^ G C{a) ); 

which returns an element of G = [51, . . . , gm\ satisfying the specified property 
if there is one, and fail otherwise. 

The performance of ElementProperty depends essentially on the size of G, 
while Solvect depends mostly on n = \A\ and to a lesser extent on d = dimi<". 
We thus perform two sets of experiments: the first in Table [T] is parametrized 
by the size of G (which is 2'^'™'') and the second in Table [2] is parametrized 
by n. In each case we measure the mean values of n, dim G and d as well as 
the times in milliseconds taken by the two solvers (ii for Solvect and t2 for 
ElementProperty) . 

The samples are generated by choosing randomly the number and dimensions 
of transitive constituents, i.e., a sequence di, . . . ,dq of strictly positive integers, 
then computing generators for the transitive constituents and composing them 
randomly to produce generators for G. In the first experiment we guarantee that 
G has the correct dimension, bounded by max^^-^ dj < dim G < X]?=i '^i ~ '^■ 
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diniG 


n 


d 


<i 


t2 


5 


26.4 ± 56% 


6.9 ± 29% 


0.384 ±310% 


0.82 ± 214% 


10 


270 ± 150% 


14.9 ± 26% 


1.46 ±137% 


20.3 ± 56% 


15 


785 ± 277% 


27.2 ± 28% 


5.4 ± 143% 


631 ± 106% 


20 


1 060 ± 229% 


35.4 ± 28% 


10.1 ±100% 


19 900 ± 90% 


25 


2 230 ±175% 


52.2 ±31% 


25.8 ± 73% 


— 


30 


2 730 ± 148% 


67.4 ± 34% 


45.3 ± 66% 


— 


35 


2 870 ± 107% 


79 ± 35% 


65.3 ± 67% 


— 


40 


8 510 ±94% 


94.2 ± 38% 


147 ± 69% 


— 


45 


7 680 ±77% 


125 ± 37% 


241 ± 64% 


— 


50 


12 800 ± 60% 


147 ± 39% 


436 ± 75% 


- 



Table 1: Experiment 1 

In the second experiment we guarantee that X]?=i 2''' = n. 2-constraints are 
also generated randomly, with the following bias: half of them are guaranteed 
to be satisfiable (a solution is chosen randomly in G). Another bias has been 
introduced: we choose the di between 1 and 13, because computing generators 
for an orbit of a size greater than 2^^ takes too much time. 

Since our random samples are by no means supposed to be representative, 
we also measure the standard deviation expressed as a percentage of the mean 
valuo We test 1000 instances on the low values and 100 on the higher ones. 
Values are rounded to 3 digits. We see that ElementProperty can hardly be 
used on groups of size much bigger than 2^'^, while Solvect works well up to 
the limits of the memory used by GAP (the limit is reached with n = 2^^). 

8 Conclusion and perspectives 

We can therefore solve fc-constraints in the class of Fp- vector spaces in guaran- 
teed polynomial time only when k = p = 2, and we have provided an algorithm 
to do so. For greater values of k and p the problem is NP-complete, which is 
quite surprising considering the rich structure of vector spaces and the relative 
simplicity of the constraints that were considered. These results confirm how 
difficult it can be to develop efficient algorithms for finding useful symmetries. 

However, our algorithm may be used on linear constraints regardless of k 
and p, and other experiments with Solvect suggests that many constraints are 
linear. Furthermore, checking the linearity of the constraint is fast. 

But this still requires the group to be an elementary Abelian p-group, which 
seems unlikely unless the problem under consideration is of a geometric nature, 
for instance if a hypercube is involved (its group of symmetries is an elementary 



^When a process takes less than 4 ms, GAP measures its duration as either or 4 ms, 
hopefully with a probability depending on this duration. In that case the mean value should 
be accurate, but standard deviation is obviously exaggerated. 
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n 


dimG 


d 


ti 


t2 


2' 


1 


1 


0.108 ±600% 


0.28 ± 375% 


22 


1.81 ±21% 


2 


0.144 ±517% 


0.308 ±351% 


23 


2.96 ± 20% 


3.68 ± 13% 


0.212 ±431% 


0.444 ± 294% 


2^^ 


4.44 ± 19% 


6.39 ± 22% 


0.344 ± 342% 


0.756 ±211% 


2^ 


6.12 ±19% 


10.4 ± 30% 


0.716 ± 224% 


1.88 ±139% 


2^ 


8.05 ± 19% 


16.2 ±35% 


1.4 ±141% 


7.05 ± 145% 


2^ 


10 ± 19% 


24.4 ± 36% 


3.08 ± 92% 


35.8 ± 188% 


2« 


12 ± 19% 


34.1 ±40% 


6.96 ± 79% 


184 ± 277% 


2^ 


14.5 ± 18% 


51.1 ±33% 


17.5 ± 74% 


1 740 ± 322% 


2io 


16.6 ± 17% 


67.6 ± 34% 


34.9 ±81% 


15 700 ±426% 


2" 


18.5 ±17% 


85.2 ± 37% 


64.4 ± 69% 


37 600 ±166% 


212 


20.3 ± 15% 


113 ±37% 


144 ± 88% 


187 000 ±315% 


213 


23.3 ± 12% 


131 ± 32% 


209 ± 66% 


— 


214 


25.8 ±9% 


189 ± 26% 


593 ± 80% 


— 


215 


28.8 ± 8% 


268 ± 23% 


1 520 ± 60% 


— 


216 


30.9 ± 7% 


446 ± 17% 


6 910 ±55% 


- 



Table 2: Experiment 2 

Abelian 2-group). In general it would be necessary to enforce this property by 
approximating the group of symmetries by one or several elementary Abelian 
p-subgroups. This is reasonable since symmetry pruning is meant to be fast, 
not complete with respect to just any group of symmetries. It seems possible to 
generalize these results to the class of Abelian groups in the line of \7\ , at the 
expense of our elegant geometric interpretation of linear constraints. We are 
currently investigating this generalization. 

Identifying tractable restrictions of the generally intractable problem of find- 
ing selected symmetries is therefore a natural approach to efficient symmetry 
pruning. Methods using only special symmetries have already been tried with 
some success, as in [9] where only transpositions are considered. We therefore 
believe the present results open interesting perspectives. 
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